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Abstract 

Background: There is accumulating evidence that circadian disruption, mediated by alterations in melatonin levels, 
may play an etiologic role in a wide variety of diseases. The degree to which light-at-night (LAN) and other factors 
can alter melatonin levels is not well-documented. Our primary objective was to evaluate the degree to which 
estimates of outdoor environmental LAN predict 6-sulftoxymelatonin (aMT6s), the primary urinary metabolite of 
melatonin. We also evaluated other potential behavioral, sociodemographic, and anthropomorphic predictors of 
aMT6s. 

Methods: Study participants consisted of 303 members of the California Teachers Study who provided a 24-hour 
urine specimen and completed a self-administered questionnaire in 2000. Urinary aMT6s was measured using the 
Buhlmann ELISA. Outdoor LAN levels were estimated from satellite imagery data obtained from the U.S. Defense 
Meteorological Satellite Program's (DMSP) Operational Linescan System and assigned to study participants' 
geocoded residential address. Information on other potential predictors of aMT6s was derived from self- 
administered surveys. Neighborhood socioeconomic status (SES) was based on U.S. Census block group data. 

Results: Lower aMT6s levels were significantly associated with older age, shorter nights, and residential locations in 
lower SES neighborhoods. Outdoor sources of LAN estimated using low-dynamic range DMSP data had insufficient 
variability across urban neighborhoods to evaluate. While high-dynamic range DMSP offered much better variability, 
it was not significantly associated with urinary aMT6s. 

Conclusions: Future health studies should utilize the high-dynamic range DMSP data and should consider other 
potential sources of circadian disruption associated with living in lower SES neighborhoods. 
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Background 

There is growing evidence that environmental light pol- 
lution may play an etiologic role in a variety of diseases, 
including depression, cardiovascular disease, and cancer 
[1-10]. Such health effects are largely thought to be me- 
diated via circadian disruption driven by alterations in 
melatonin production and secretion [11,12]. Melatonin, 
an endogenous hormone produced by the pineal gland, 
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is the primary hormonal modulator of circadian regula- 
tion in mammals and is strongly influenced by exposure 
to visible light, with levels peaking during the darkness of 
night [13]. Predictors of melatonin levels in humans gen- 
erally have not yet been well- characterized [11,14-18]. 
While there is fairly strong evidence that indoor artificial 
light-at-night (LAN) is sufficient to suppress melatonin 
levels in humans, it is not known whether outdoor envir- 
onmental light pollution is sufficient to exert a similar 
effect [1,3,19]. Fully understanding such predictors is a ne- 
cessary prerequisite for implementing epidemiologic stud- 
ies on this topic. 
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The primary objective of the current analyses was to 
evaluate the degree to which estimates of outdoor 
environmental light-at-night (LAN) predict levels of 
6-sulfatoxymelatonin (aMT6s), the primary urinary 
metabolite of melatonin, among 303 California women. 
Secondarily, we evaluated whether selected anthropo- 
morphic characteristics and behavioral and sociodemo- 
graphic factors predict urinary concentrations of aMT6s. 

Results 

Creatinine adjusted urinary aMT6s concentrations ranged 
from 0.58 ng/mg-creatinine to 102.06 ng/mg-creatinine 
with a mean of 20.66 ng/mg-creatinine, a standard devi- 
ation of 17.78 ng/mg-creatinine and an interquartile range 
of 17.74 ng/mg-creatinine. 

The characteristics of the study population are sum- 
marized in Table 1. Similar to the entire CTS cohort, the 
study population is predominantly non-Hispanic white 
(85%) and middle-aged (mean age = 55 years). Despite 
arising from an occupational cohort that requires at least 
a 4-year college degree among its members, study partic- 
ipants lived in both high and low SES neighborhoods, 
although skewed towards higher SES neighborhoods. By 
design, study participants were oversampled from rural 
neighborhoods (60.4%). 

Table 2 summarizes estimates of outdoor LAN expo- 
sures. The average length-of-night was 11 hours, reflecting 
the fact that most urine specimens were collected in the 
winter and early spring months. The radiance estimates of 
outdoor LAN ranged from 4 to 63, with a mean value of 
43.4, for the low-dynamic range 2000 data, and 3.8 to 
465.2, with a mean of 128.4, for the high-dynamic range 
2006 data. Figure 1 illustrates the greater range and vari- 
ability in radiance estimates offered by the high-dynamic 
range data. More importantly, Figure 1 illustrates that the 
low-dynamic range data lacks sufficient variability at the 
upper range to discriminate areas with higher LAN values 
within urban neighborhoods. In addition, LAN values 
were universally higher among women living in urban 
compared to rural neighborhoods, with no overlap in 
values across rural and urban neighborhoods. In contrast, 
the high-dynamic range data offers variability in values 
within both rural and urban areas as well as some overlap 
in values across urban and rural areas. Consequently, we 
chose to rely on the high-dynamic range data for the re- 
mainder of our analyses. 

The results from our random forests analyses to iden- 
tify the most important predictors of urinary aMT6s 
concentrations, based on the scaled variable importance 
values (mean divided by the standard deviation), identi- 
fied the following 8 variables as potentially important 
predictors of aMT6s: hormone therapy use; oral contra- 
ceptive use; length-of-night; coffee consumption; strenu- 
ous exercise in the past 3 years; age; menopausal status; 



and neighborhood SES. Overall, however, none of these 
variables, individually or combined, explained a substan- 
tial proportion of the variability in measured aMT6s 
levels (data not shown). 

Prior to conducting our regression analyses, we exam- 
ined the shape of the relationship between SES and 
aMT6s. A plot of the smoothed aMT6s on the first com- 
ponent of our SES PCA (Figure 2) demonstrates that 
increases in melatonin are seen with increases in SES 
but only among those living in neighborhoods for which 
the PCA value for SES did not exceed zero. Once the 
summary neighborhood SES reaches the average value 
(represented by zero), the relationship flattens out. Con- 
sequently, in our regression model selection process, we 
represented the summary SES measure as a continuous, 
piecewise linear function that has one join point at zero 
and is constrained to have zero slope when the PCA 
value exceeds zero. 

Model selection by backwards elimination, starting with 
the 8 variables identified through our random forests ana- 
lyses (see above), identified length-of-night, SES, and age 
as the only significant predictors of aMT6s (Table 3). 
Levels of aMT6s declined with older age and increased 
with longer nights and higher SES. Among the subset of 
women over the age of 55 years, SES was not a significant 
predictor of aMT6s while length-of-night and age became 
stronger and more significant predictors. Overall, these 
models explained very little of the variance in aMT6s 
(R 2 for all women = 0.03; R 2 for women > 55 years = 0.07). 

Although the outdoor LAN variable was not selected 
through our stepwise model building approach, because 
it was of primary interest to us, we evaluated it separ- 
ately in age-adjusted regression models and found it to 
have a very small inverse association with aMT6s that 
was not significant in either the full study population 
OS =-0.0028, se = 0.008, p = 0.73) or among those over 
age 55 Q3 = -0.0062, se = 0.012, p = 0.62). 

Discussion 

Overall, results from our study failed to identify factors 
that substantially predict urinary measures of aMT6s. 
The only factors that were significantly related to aMT6s 
levels were age, length-of-night, and SES and together 
they explained very little of the variance in aMT6s levels. 
Nevertheless, our study does provide some important 
findings relevant to future investigations of health out- 
comes related to melatonin and/or LAN exposures. 

There is increasing interest in the use of satellite im- 
agery data to evaluate potential effects of outdoor envir- 
onmental light pollution in wildlife and in human 
populations, including a number of studies that have 
investigated potential links to cancer incidence [20-22]. 
Our analyses demonstrated that the widely-available 
low-dynamic range annual satellite imagery data are 
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Table 1 Characteristics of study population (n = 303) 



Characteristics 


Distribution 


Age at urine collection, years 


55.0 (1 1.9) 


(mean,standard deviation) 




BMI at urine collection, kg/m 2 


26.6 (5.8) 


(mean,standard deviation) 




1— Ipinht f hs^plinp ir\/p\/ 


65.0 (2.7) 


inches (mean,standard deviation) 
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4 0 (10 01 


Pmyears or aspirin use (mean.standard deviation) 


0.69 (2.0) 


Coffee consumption, average grams/day 


238.2 (184.7) 


(mean.standard deviation) 




Coffee consumption, average times/day 


0.97 (0.72) 


(mean.standard deviation) 




Strenuous exercise, 3-years prior to baseline, 


1 .6 (2.4) 


hours/wk (mean.standard deviation) 




Strenuous exercise, lifetime prior to baseline, 


2.2 (2.3) 


hours/wk (mean.standard deviation) 




Age at first full-term pregnancy, years 


27.0 (4.9) 


(mean.standard deviation) 




r\ .■_ j_ l l £ l ■ i .-ii i • . i 

Parity, total number of live and still births 


1.7 (1.4) 


(mean.standard deviation) 




Race/ethnicity (n,%) 




Non-Hispanic white 


258 (85.1) 


Hispanic 


18 (5.9) 


Black 


3 (1.0) 


Asian & Pacific Islander 


1 3 (4.3) 


Other/unknown 


1 1 (3.6) 


Menopausal Status, at baseline(n,%) 




Postmenospausal 


131 (43.2) 


Premenopausal or Perimenopausal 


147 (48.5) 


Unknown 


25 (8.3) 


Alcohol Consumption, year prior to baseline (n, %) 




None 


86 (28.4) 


<20 g/day 


176 (58.1) 


>20 g/day 


27 (8.9) 


Unknown 


14 (4.6) 


Tobacco Smoke Exposure, at baseline (n, %) 




None 


61 (20.1) 


Passive only 


142 (46.9) 


Former active smoker 


86 (28.4) 


(~ \ irrpnt ar~l~i\/P cmok'Pr 


12 C4 01 


Unknown 


2 (0.7) 


Oral contraceptive use, at baseline (n, %) 




Never 


81 (26.7) 


Former 


1 98 (65.4) 


Current 


16(5.3) 


Unknown 


8 (2.6) 


Antidepressant use, daily use for at least 2 months 




at baseline 





Table 1 Characteristics of study population (n = 303) 

(Continued) 



Yes 


29 (9.6) 


No 


237 (78.2) 


Unknown 


37 (12.2) 


Hormone therapy use, at baseline (n, %) 




Premenopausal 


142 (46.9) 


Peri/post-menopausal, never used 


34 (11.22) 


Peri/post-menopausal, prior use 


16(5.3) 


Peri/post-menopasual, current use 


71 (23.4) 


Other/unknown 


40 (13.2) 


Regular aspirin use, years at baseline 




None 


240 (79.2) 


<1 
1 


1 1 (3.6) 
2 (0.7) 


2 


4(1.3) 


3-4 


7 (2.3) 


5-9 


6 (2.0) 


10+ 


31 (10.2) 


Unknown/missing 


2 (0.7) 


Regular aspirin use, days/week at baseline 




None 


240 (79.2) 


1-3 


41 (13.5) 


4-6 


9 (3.0) 


/ 


1 2 (4.0) 


Unknown 


1 (0.3) 


Daily Use of Calcium Blockers, in 2000 




Yes 


4(1.3) 


No 


246 (81.2) 


Unknown 


53 (17.5) 


Neighborhood Urbanization 




Urban 


120 (39.6) 


Rural 


\ OS (60.4) 


Hours of sleep/night 




<5 hours 


3 (1.0) 


5-6 hours 


63 (20.8) 


7-9 hours 


226 (74.6) 


10+ hours 


4(1.3) 


Unknown 


7 (2.3) 


Neighborhood Socioeconomic Status 
(mean, range) 




Median Family Income ($) 


76,782 (37,337- 
140,387) 


Percent of population below poverty level 


6.8 (0.0 - 37.6) 


Percent of adults age 25+ with college degree 


34.3 (1.5 - 79.0) 


Percent of adults without a high school degree 


14.6 (1.2 - 56.8) 


Percent of adults employed in a professional 
occupation 


44.3 (6.7-74.3) 
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Table 2 Estimates of outdoor LAN among study 
participants (n = 303) 



Outdoor LAN satellite values 
(annual average radiance) 

Low-dynamic range, 2000 average (DN units) 

High-dynamic range, 2006 average 
(scaled radiance units) 



Mean (Range) 



43.4 (4 - 63) 

1 284 (3.8 - 465.2) 



Length-of-night on day of urine collection (hours) 1 1 (9.2 - 14.4) 



insufficient for distinguishing areas with differing LAN 
values within suburban and urban areas. Since many 
health outcomes, including many cancers, are more com- 
mon in urbanized areas [23-27], use of the low-dynamic 
range satellite data will likely be inadequate for investigat- 
ing LAN exposures within such areas. While the high- 
dynamic range data are currently only available for 2006, 
efforts are underway to develop such high-dynamic range 
data for other years [28]. Researchers interested in this 
topic would be prudent to pursue such data. 

The other novel finding generated by our analyses was 
the significance of neighborhood SES. Our analysis 
showed that aMT6s concentrations tended to be low 
among women living in the lowest SES areas and in- 
creased with greater neighborhood SES up to a point 
after which the concentration leveled off and possibly 
declined slightly at the very highest levels of SES. Fur- 
thermore, this relationship, while statistically significant in 
the full study population, appeared to be of lesser import- 
ance in the women over age 55. Overall, very little is 
known about how melatonin varies by SES. Similar to our 
findings, a study of approximately 200 Canadian women 
reported that urinary measures of aMT6s were lower 
among the lowest educated women (i.e. high school dip- 
loma or less), highest in the middle category of education, 
with a slight decrease in the highest educated group [29]. 
In contrast, Burgess and colleagues in their evaluation of 
predictors of salivary measures of melatonin found educa- 
tion was not a significant predictor [30] . 

Interpretation of our findings on SES can only be 
speculative. It is possible that the neighborhood measure 
of SES is serving as a proxy for a number of behavioral 
and/or anthropomorphic factors that influence mela- 
tonin levels. We did, however, incorporate variables in 
our model selection process for many of the most im- 
portant of these factors including alcohol consumption, 
smoking, BMI, use of exogenous hormones and other 
medications, coffee consumption, and physical activity - 
none of which were independent and significant predic- 
tors of aMT6s. Another possibility is that low SES 
neighborhoods may have environmental stressors (e.g., 
noise, crime) other than light pollution that we did not 
measure in our study but could disrupt circadian 



rhythms and lead to lower levels of aMT6s. Given that a 
number of the health outcomes of high interest with re- 
spect to circadian disruption display strong SES-related 
risk gradients, research into other features of the built en- 
vironment represents an important area for future inquiry. 

In addition to SES, age and length-of-night were the 
only other significant predictors of aMT6s identified in 
our analysis. These findings are generally consistent with 
the limited body of literature on this topic. Two studies 
have reported seasonal differences in melatonin levels 
with higher levels associated with greater length-of-night 
[15,17] and lower levels in the summer months [17], 
although two other studies reported no relationship with 
month [18] or season [31]. Most studies have reported 
declines in melatonin levels with increasing age, albeit 
most have reported this relationship to be linear 
[15,17,18,31-33]. Consistent with our results, two studies 
noted more dramatic age-related declines among older 
individuals [34,35]; conversely, a few other studies sug- 
gested that the age-related declines in melatonin are 
greater, or are limited to, early adulthood and then level- 
off [18,36-38]. Measurements of melatonin in older 
adults living under controlled light-dark conditions in 
one study reported no age-related declines in melatonin 
levels, leading the authors to suggest that changes in 
sleep behaviors might be responsible for the age-related 
declines reported in other observational studies [39]. In 
our study, however, we observed no differences in sleep 
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Figure 1 Marginal distribution of Outdoor LAN estimates and 
distribution by urban classification for the low-dynamic range 
(year 2000) and the high-dynamic range (year 2006) data. The 

units in the "low dynamic range" data are "DN" units, while the units in 
the "high dynamic range data" are scaled radiance units (see Methods). 
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First component of SES PCA 

Figure 2 Estimated relationship between neighborhood SES and aMT6s, along with the pointwise 95% confidence band (shaded in 
gray). The relationship increases until SES equals zero, at which point it flattens out and no statistically significant slope remains (the dashed 
vertical line indicated where the SES PCA component is equal to zero). The lines across the bottom indicate the SES values for the sample. 
Note that the spline only shows deviation of aMT6s relative to the average aMT6s value across the sample. 



duration by age group (data not show). Our results, in 
the context of the somewhat conflicting literature on 
this, underscore the need to carefully consider the shape 
of the age-related risks when conducting health-related 
research on this topic. The fact that we observed a shift 
in the relationship between age and aMT6s around age 55 
suggests this may be especially important when evaluating 
health risks in women, whose risks often change in their 
early- to mid-50s after menopause. This may be particu- 
larly relevant to breast cancer which exhibits a different 
constellation of risk relationships in pre- and post- 
menopausal women and for which LAN exposures have 
been postulated to be a potential risk factor. 

Our evaluation of outdoor LAN, which showed a very 
modest inverse, but not statistically significant, relation- 
ship to aMT6s concentrations, was hindered by a 
number of obstacles. While we used the best satellite 



imagery data available to objectively estimate outdoor 
LAN values, because the high-dynamic range data was 
only available for 2006, the LAN estimates were not 
temporally congruent with the urine in which the 
aMT6s was measured (collected in 2000). An examin- 
ation of the low-dynamic range data for all years span- 
ning this time period (2000-2006), however, suggested 
relative stability in LAN values over this time for the 
study area. Furthermore, the Spearman rank correlation 
between the 2000 low-dynamic range data and the 2006 
high-dynamic range data was 0.96. 

Another reason for the lack of an association seen be- 
tween outdoor LAN and urinary measures of aMT6s 
could be that while the satellite imagery data may be an 
adequate predictor of outdoor ambient light, it may not 
reflect light exposures experienced at night when partici- 
pants are likely to spend the majority of their time 



Table 3 Regression coefficients (p) for final model estimating log 2 creatinine-adjusted aMT6s 





Variable 


(3, standard error 


t statistic (2-sided p-value) 


All Subjects 


Length-of-night 


0.1081, 0.0612 


1 .76 (0.08) 




Age 


-0.01 1 3, 0.0066 


-1.79 (0.09) 




SES 


0.1375, 0.0637 


2.16 (0.03) 


Ages > 55 years 


Length-of-night 


0.2341, 0.0963 


2.43 (0.02) 




Age 


-0.0284, 0.0152 


-1.87 (0.06) 
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indoors. Intervening factors, such as the use of blinds/ 
curtains, time spent indoors, other sources of indoor 
light, etc. all are likely contributors to LAN exposures. 
The importance of this is underscored by findings from a 
recent study that compared estimates of LAN from satel- 
lite imagery data to calibrated photometric measurements 
obtained from personal monitors and reported that satel- 
lite imagery data did not correlate with personal photo- 
metric measurements [40]. Thus, it is important that 
studies aimed at evaluating outdoor LAN exposures do so 
with full consideration of these other factors. 

Finally, there are a number of limitations in the esti- 
mates of our melatonin levels that are worth noting. The 
urine specimens were collected over a 24-hour time 
period. This precludes our ability to examine the timing 
of peak melatonin concentration which is likely an im- 
portant factor in circadian disruption [13]. Furthermore, 
while there is a good deal of evidence that first morning 
urine captures peak night-time melatonin excretion, the 
degree to which 24-hour urine specimens reflect this is 
not well-documented [13,31]. The aMT6s assay, how- 
ever, is a well-validated biomarker, demonstrating good 
correlation with serum melatonin levels [13], and suffi- 
cient stability over time [13,15,31,41,42] to serve as a 
reasonable estimate of chronic levels. 

Conclusions 

There is growing evidence that circadian disruption, me- 
diated by alterations in melatonin secretion driven by 
night-time light exposures may play an etiologic role in 
a large array of diseases. The use of satellite imagery data 
to estimate ambient measures of LAN offers an innova- 
tive and inexpensive source of data to test emerging hy- 
potheses on this topic. The results from our analyses 
highlight some important cautionary limitations to the 
use of such data as well as point to some additional ave- 
nues of pursuit, including the elucidation of factors asso- 
ciated with neighborhood SES that may play a role in 
circadian disruption. 

Methods 

Study participants 

Study participants consisted of 303 members of the 
California Teachers Study (CTS) residing in the San 
Francisco Bay Area, who as part of a special substudy 
conducted in 2000, completed a self-administered ques- 
tionnaire, an in-person interview, and supplied a 24- 
hour urine sample. The CTS is a prospective cohort 
study initiated in 1995, consisting of 133,479 female pro- 
fessional California school employees for whom exten- 
sive information has been collected through a baseline 
questionnaire and three subsequent mailed question- 
naires [43]. Members of the special substudy included a 
random sample of 528 CTS participants who were 



< 85 years old when the cohort was established in 1995, 
and who resided in the substudy area (comprised of 
western Alameda, Santa Clara, San Mateo, Santa Cruz, 
Monterey, and northern San Benito counties). Forty-four 
(8%) of these women were not contacted because they 
had died, moved out of the substudy area, or could not 
be located. Of the 484 women invited to participate, 328 
agreed, of whom 303 provided a 24-hour urine sample. 
Further details of the special substudy can be found else- 
where [44]. The use of human subjects was reviewed 
and approved by the California Health and Human Ser- 
vices Agency, committee for the Protection of Human 
Subjects and the Cancer Prevention Institute of 
California's Institutional Review Board. 

Data and specimen collection 
Specimen collection 

Participants were given urine collection kits and instruc- 
ted to collect all urine produced over the next 24-hour 
period. Participants were asked to use cold packs to keep 
the sample chilled and, if willing, to refrigerate it during 
the 24-hour collection period. After 24-hours, staff col- 
lected and stored the samples at -20° Celsius until they 
could be delivered to our laboratory (within 2 weeks) 
where it was thawed, aliquoted into 10 mL vials, and 
frozen at -70" Celsius. Prior to aliquoting, total urine 
volume was measured and recorded. 

Laboratory methods 

Urinary concentration of aMT6s was measured by 
Pacific Toxicology Laboratory (Chatsworth, CA, USA) 
using Biihlmann aMT6s enzyme-linked immunosorbent 
assay (ELISA) kits purchased through ALPCO (Salem, 
NH, USA). The aMT6s ELISA is a competitive immuno- 
assay using a capture antibody technique that has a 
lower detection limit of 0.8 ng/mL and an analytic high 
sensitivity limit of 40 ng/mL. The functional sensitivity 
of the assay was 1.5 ng/mL, while the intra-and inter- 
assay precision were 7.1% and 11.9%, respectively. Urin- 
ary creatinine levels were obtained from laboratory 
assays using a modified-rate Jaffe method conducted as 
part of a prior analysis by the University of Alabama, 
Birmingham in 2000. aMT6s levels were creatinine- 
adjusted by dividing the aMT6s concentration by the 
creatinine concentration. 

Four samples were below the detection limit and five 
samples exceeded the analytic high sensitivity limit (3% 
of samples total). Using PROC MI with a TRANSFORM 
LOG(var) statement in SAS 9.3 we imputed the values 
below and above the limit of detection, setting range pa- 
rameters of 0-0.8 ng/mL and 40-60 ng/mL respectively. 
The upper range of 60 ng/mL for those samples above 
the analytic high sensitivity limit was based upon upper 
levels observed in 24-hour samples from women in a 



Hurley et al. International Journal of Health Geographies 2013, 12:39 
http://www.ij-healthgeographics.eom/content/12/1/39 



Page 7 of 9 



previous study [18]. We used the first imputation to as- 
sign values of aMT6s to these nine participants. 

Outdoor environmental LAN data 

Information on outdoor exposures to ambient LAN was 
derived from nighttime satellite imagery data obtained 
from the U.S. Defense Meteorological Satellite Program's 
Operational Linescan System (DMSP-OLS) [45]. This 
database contains annual composites, made after exclud- 
ing the outer quarters of the satellite swath, light from 
the sun and moon, glare, clouds, and atmospheric 
lightning. Ephemeral events, such as fires, are also 
discarded. While these images capture only a fraction of 
the light originating from the earth's surface, they 
accurately represent the relative levels of nighttime illu- 
mination at ground level [45]. The imagery data is 
georectified to a 30 arc-second grid (equivalent to 
approximately one-square kilometer). Two kinds of radi- 
ance information were available: adaptive gain, low 
dynamic range data; and fixed-gain, high dynamic range 
data. The low dynamic range data are available for mul- 
tiple years, but consist of unit-free (called DN, or 
"Digital Number" units) data with a dynamic range of 
six bits, i.e., integers ranging from 0 (background noise) 
to 63. The fixed gain, high dynamic range data, is avail- 
able only for 2006 and is obtained by combining infor- 
mation from several fixed-gain sensors. It consists of 
floating point numbers ("scaled radiance" units) that, if 
needed, can be scaled by 1.51586 x 10" 10 to obtain data that 
are watts/sr/cm 2 (see the F16_2006_radiance_readme.txt 
file in reference [45] for caveats with respect to this scale 
factor). Such scaling was not performed for these analyses. 
Residences from study participants were assigned the aver- 
age nighttime radiance value for the grid cell in which they 
were located using spatial analysis tools available in the 
raster, rgdal, and sp packages within R [46]. Initially we 
assigned to residences both the low-dynamic range data 
for 2000 and the high-dynamic range data for 2006. 

As an additional measure of LAN we generated a 
"length-of-night" variable, created by linking the date of 
urine collection with daily sunrise and sunset data for 
San Francisco for the year 2000 obtained from the Naval 
Observatory [47]. From these data we calculated the 
hours of darkness on the day of urine collection. 

Anthropomorphic and behavioral factors 

Information on anthropomorphic and behavioral factors 
of interest was derived from the CTS surveys. Factors 
were considered as potential predictors for these ana- 
lyses based on a review of the limited literature on this 
topic [15-18,31-35,37,38,42,48-51] and included: age, 
race/ethnicity, body mass index (BMI), height, parity, 
number of live births, age at first full-term pregnancy, 
pack-years of smoking, alcohol consumption, oral 



contraceptive use, hormone therapy use, menopausal 
status, strenuous physical activity, average hours of sleep 
per night, age at menarche, coffee consumption, and the 
use of aspirin, calcium blockers and antidepressants. 

Sociodemographic data 

Information on neighborhood socioeconomic character- 
istics and urbanization were derived from U.S. Census 
2000 data. We used the ArcGIS, version 9.2 (ESRI, 
Redlands, CA, USA) to geocode the participants' home 
addresses to both a latitude/longitude and to a Census 
2000 block group. Data on socioeconomic status (SES) 
included: percentage of adults over age 25 years having 
completed a college degree or higher; percentage of 
adults without a high school degree; income (median 
family income), occupation (percentage of adults emplo- 
yed in managerial/professional occupations), and poverty 
(percentage of population below the poverty line). The 
degree of neighborhood urbanization was characterized 
as either urban/suburban or rural, based on a pre- 
viously-developed algorithm using a combination of cen- 
sus block group characteristics, details of which can be 
found elsewhere [26]. 

Statistical methods 

Given the extensive information collected on potential 
predictors of melatonin levels, our initial approach was 
to use Breiman's random-forest-based (RF) variable im- 
portance measures, as described by Lunetta [52] to iden- 
tify the most important predictors of log 2 transformed, 
creatinine adjusted aMT6s from the large number of fac- 
tors for which we had information. (The aMT6s levels 
were highly skewed to the right, necessitating the log 2 
transformation.) This approach allowed us to compose a 
list of candidate variables, ordered by their relative im- 
portance, taking into account both multiple testing and 
potential interactions. The top ranking variables were 
then considered in linear models to further quantify pre- 
dictors of aMT6s concentrations. Final model selection 
via R's stepAIC function was chosen in a backwards 
elimination process using the Akaike information criter- 
ion (AIC) to quantify the model fit. 

We first focused on determining the most appropriate 
model specification for age. A plot, obtained by fitting a 
Generalized Additive Model (GAM) with an intercept 
and a spline term for age, demonstrated a decrease in 
aMT6s with increasing age, with a possible downward 
curvature at age 55. An analysis of variance to compare 
null, linear, and quadratic linear models for the regres- 
sion of the creatinine adjusted aMT6s concentrations on 
age, however, indicated that the best fitting model was 
one with age modeled as a simple linear term. Because 
restricting our analyses to women over age 55 resulted 
in a much stronger and significant linear effect for age 
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than observed in the full study population, we conducted 
our subsequent analyses on the full study sample, as well 
as among women over the age of 55. To address the high 
degree of collinearity between the SES variables, we 
conducted a principal components analysis (PCA) to cre- 
ate a composite measure of SES based on the five individ- 
ual components as described above. The loadings of the 
first principal component were then used as a composite 
measure in the random forests and regression models. 

Abbreviations 

aMT6s: 6-suIftoxymeIatonin; LAN: Light at night; CTS: California Teachers 
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